Developing a Protein Interaction Prediction Algorithm on HPC

نویسندگان

  • Wael S. Afifi
  • Ali A. El-Moursy
  • Salwa Nassar
چکیده

The prediction of protein-protein interaction is one of the fundamental problems in bioinformatics. A novel algorithm called STRIKE has shown to achieve good performance in protein-protein interaction prediction. It assumes that proteins interact if they contain similar substrings of amino acids. In this paper, we developed a parallel STRIKE algorithm and we implemented our proposal on Cluster system. Using short protein sequence sets, the overall execution time of a parallel implementation of this bioinformatics algorithm was decreased to about 5 times when increasing number of nodes from one compute node to 6 parallel nodes. Key optimizations to the implementation are also discussed. Keywords— protein-protein sequence matching; parallel computing; performance analysis; HPC computing; sequence

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks

Background: Prediction of the protein localization is among the most important issues in the bioinformatics that is used for the prediction of the proteins in the cells and organelles such as mitochondria. In this study, several machine learning algorithms are applied for the prediction of the intracellular protein locations. These algorithms use the features extracted from pro...

متن کامل

Discovering Domains Mediating Protein Interactions

Background: Protein-protein interactions do not provide any direct information re‌garding the domains within the proteins that mediate the interactions. The majority of proteins are multi domain proteins and the interaction between them is often defined by the pairs of their domains. Most of the former studies focus only on interacting do‌main pairs. However they do not consider the in...

متن کامل

Prediction of Coffee Effects in Rats with Healthy and NAFLD Conditions Based on Protein-Protein Interaction Network Analysis

Background and objectives: Non-alcoholic fatty liver disease (NAFLD) is a common liver condition. On the other hand, coffee consumption has shown promising for gastrointestinal diseases.  Detection of the most valuable biomarkers of decaffeinated coffee treatment in healthy and non-alcoholic fatty liver disease conditions was the aim of the present study. Methods:</stro...

متن کامل

Inverse protein folding in 3D hexagonal prism lattice under HP model

The inverse protein folding problem is that of designing an amino acid sequence which has a prescribed native protein fold. This problem arises in drug design where a particular structure is necessary to ensure proper protein-protein interactions. Previously, tubular structures for a three-dimensional (3D) hexagonal prism lattice were introduced and their stability was formally proved for simpl...

متن کامل

Developing a Dynamic Regression Model for Predicting Future Operating Cash Flow

The purpose of this research is to develop a dynamic regression model for prediction of future operating cash flows of firms accepted in Tehran Stock Exchange. So, the information of 250 companies were considered during 2004 to 2017. In this study, operational and economic variables were added to the fundamental model of Bart, Cram and Nelson (BCN). Due to the simultaneous effect of sales growt...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012